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1. APPLICATIONS AND REPORTS ON RESEARCH 


@. INFORMATION RETRIEVAL RESEARCH IN THE U.S. PATENT OFFICE* 





Totroduction 


In its mission of issuing patents for inventions, the U. S. 
Patent Office is required by law (Rl) to reject applications for 
patents which do not, inter alia, claim novel subject matter. The 
Office has a professional staff of scientists, each of whom 10 
examines the applications in a small sector of science and tech- 
nology (the so called prior art). The first step in the examina- 
tion is a search of the prior art to locate any document, or com- 
bination of documents, which discloses any subject matter from 
which the examiner can evaluate the question of novelty pre- 
sented by the application. The documents so obtained are known 
as references. Of course, the other requirements in addition to 
novelty which govern issuance are also considered during examina- 
tion. The examining corps spends approximately 60% of its time 
merely searching (as opposed to the remaining steps, including 
evaluating the retrieved references) in what is one of the larg- 
est information retrieval operations in existence. 


In 1954, in its consideration of a Department of Commerce 
budget request, the U. S. Senate Appropriations Committee (R2) 
directed that the Department 


"... make an aggressive and thorough investigation as 
to the possibility of mechanizing the searching operations 
eee in order to modernize, insofar as practical, the Patent 
Office operations." 


The Secretary of Commerce appointed an Advisory Committee, which 
recommended in part (R3): 


35 
"..e- A Research and Development unit should be estab- 
lished in the Patent Office.... The National Bureau of Stand- 
ards and the Patent Office should undertake a joint program 
to stimulate and develop ... techniques specifically adapted 
to Patent Office operations....” 


On January 31, 1955, a small unit was formed within the 
Classification Operations of the Patent Office, and on April 8, 





*The content of this article is in the public domain. 
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1956, this matured into the Office of Research and Development, 1 
directly under the Office of the Commissioner of Patents. On 
August 3, 1957, Mechanized Examining Division A was created under 
the Office of Research and Development to handle the automated 
search of the first sector of the prior art which had been pre- 5 
pared for search mechanization. 


Our Problems of Retrieval 





It has been stated many times that the very discriminating 10 
search which the Patent Office makes is different from the types 
of searches made by others for information. It is our studied 
opinion that this is not true, that our searching and other search- 
ing differ merely in degree, and not in kind. Others have either 
been afraid to hope for such detailed retrieval, or have aban- 15 
doned it as visionary. But we have been retrieving by manual 
methods with this degree of detail for years, and research scien- 
tists, who have taken the trouble to come to Washington to use our 
search files, have uniformly been surprised at the level of detail 
which has been provided. 20 


Successful mechanization of a manual system does not neces- 
sarily consist in mechanically performing the same steps in pre- 
cisely the same way. The mechanized horse and surrey is not a 
tractor-trailer unit, but an automobile; the domestic coak 25 
stoker feeds coal from the bottom of the furnace bed, many con- 
mercial coal burning furnaces pre-grind the coal, mix it with air 
and blow it into the firebox, while manually fired furnaces are 
stoked by adding coal on top of the fire bed. Successful mechan- 
ized retrieval systems may thus prove to be substantially dif- 30 
ferent in operation from the conventional search systems which 
they supersede. 


Patents and other documents written in expository prose, even 
when referring to the same phenomenon, will use terms on dif- 35 
derent levels of generality and specificity. An article, device, 
method, or other phenomenon may be described by any of a number of 
different facets or aspects of its nature. To further complicate 
this situation, such facets or aspects usually occur on different 
levels of generality. Furthermore, much significant informe- ho 
tion in technical prose is present by implication, rather than 
explicity expressed. 


As a simple illustration of two levels of generality, a dog, 
cat, parakeet, and guppy are each "vertebrates", but only the 45 
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first two are "mammals". However, to some of us, a dog, cat, 1 
parakeet, and guppy may also belong to the genus "pet". Sime- 
taneously, to others, the same dog may belong to the genus "burg- 
lar protection", together with a padlock and a policeman. This 

cat also belongs to the genus "rodent destroyers”, together with 5 
a@ mousetrap and arsenic. This illustrated facets or aspects at 

two levels of generality, but with the same species subsumed from 
different genera. 


Implicit disclosure is evidenced, for example, by specifi- 10 
cation of the liquid state of a compound which inherently suggests 
the existence of a freezing and a boiling point, a specific 
gravity, and other physical properties. If the compound is a dye, 

a color is implied; if a pharmaceutical, a disease or other bodily 
malfunctioning and a process of treatment or cure or both are 15 
implied; if an explosive, a method of and means for both priming 
and igniting; etc. 


In tne process of retrieving similar and analogous subject 
matter, the ability to specify interrelationships among units 20 
of subject matter (articles, devices, method steps, and other 
phenomena) enhances precision of the search and reduces the in- 
cidence of false or unwanted answers. The utility of interrela- 
tionship is seen in contrasting the information value of parts 
lists and circuit diagrams for television and radar sets. An 25 
expert electronics engineer, if shown only the two parts lists, 
would have difficulty in identifying the list with the appro- 
priate device, but when presented with only the circuit diagrams 
even without values for the individual components, he could im- 
mediately distinguish between the two because of the interrela- 30 
tionships of the components shown. 


The user of a retrieval system for any file of documents can- 
not, of course, know whether a document discloses or implies the 
desired subject matter, or, if present, the level of generality 35 
and the facets or aspects by which it has been described. To be 
of optimum value, therefore, a system should provide for each of 
these factors in its file preparation and its searching strategy. 


Our Research Program Lo 





The Patent Office and the National Bureau of Standards, 
jointly, separately and with outside contractors (including the 
Bureau of the Census), have been and are still engaged in a series 
of programs which range from actual productive mechanized 45 
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searching of a limited group of chemical compounds to explora- 1 
tory research in information retrieval theory including mathe- 
matics and linguistics. Most of the early research effort was 

in the field of chemical subject matter. The scope was later 
broadened to include studies of problems posed by language and 5 
its ambiguities and by disclosures in the mechanical and elec- 
trical arts. Our own studies have tended to emphasize problems 
having immediate application to the patent examining operation 

over those having projected applications of broader scope. 





10 
Our first attempt at automation of patent searching ante- 

ted the Congressional mandate. This research was done in 1947 
in the Chemical Division of the Classification Group of the Patent 
Office. A mechanized search was actually attempted in 1947, but 
the proposed system was not sufficiently sophisticated for pat- 15 
ent searching. Other systems were conceived and studied, and in 
1950, an operative system was tested (R4). After the test, work 
was suspended for administrative reasons until 1955. 


The reconstituted research and development group began 20 
with a reappraisal of this first attempt followed by work on 
systems design. Subsequently, several of the new group began a 
cooperative program with the Data Processing Systems Division of 
the National Bureau of Standards. This cooperative program, chris- 
tened HAYSTAQ, is an experimental effort designed for long range 25 
studies to develop a completely comprehensive search routine for 
a large Data Processing Computer (R5). Although specifically 
tailored to chemical disclosures, general applicability of the 
system to any topological network is planned. The system has been 
designed to provide for searching for compounds, compositions 30 
of matter (i.e., mixtures of compounds), processes of manufacture 
and use, and other related matters. In the construction of the 
computer program, all of the desired limitations of a patent search 
have been considered, so that this program may later be extrapo- 
lated to eventually retrieve substantially all information ina 35 
publication. Currently, a comprehensive search file of chemical 
compounds in one segment of the chemical arts is being prepared for 
use in a first approximation of the desired type of search. 


Another early effort (R6) was an analysis of the examiner's 40 
duties in patent examination with particular emphasis on the pro- 
blems entailed in mechanization of search procedures. 


By contrast with these early projects, the amenability of the 


structures of chemical compounds to routinized designations has 45 
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facilitated the development of several immediately usable systems 1 
for the small areas of subject matter to which they apply. Perhaps 
the most spectacular is the coding system for steroid compounds by 
which many variations of the present structure have been represent- 
ed in generic and specific aspects (R7). Steroid chemistry con- 5 
stitutes one of the more complex organic chemistry arts. The cod- 
ing system which was devised is a relatively unsophisticated method 
by which the steroid compounds content of the approximately 1600 

U. S. patents in this art was entered into a series of punched 

cards with flexibility. in generality sufficiently detailed for 10 
productive search. Machine search of the punched-card file as 

part of the examination of patent applications in this art conm- 
menced in September 1957, in the Mechanized Examining Division. 
From that time through May 1960, approximately 14,000 mechanical 
searches have been made by examiners in this division. This sys- 15 
tem has been widely acclaimed and publicized; one measure of its 
value is indicated by the undertaking of the Pharmaceutical Manu- 
facturers Association to finance a literature supplement to the 
patent file. A method for extracting and preparing the steroid 
data for machine input was established which eliminates, as far 20 
as possible, manual repetition and transcription of data, thus 
conserving time, space, and professional effort. 


In conjunction with early work on the steroids, a limited 
topological search for organic compounds, conceived in the Patent 25 
Office, was programmed by National Bureau of Standards personnel 
(R8) for their SEAC computer. While adequate to minimal needs for 
steroid compounds, it proved cumbersome and slow, and was almost 
immediately superseded by the present productive system. 

30 

A system, known as the VS- system (R9), developed for serially 
searching all known types of py As pees structures, subsequently ser- 
ved as a springboard for several current projects. In this system, 
generic and subgeneric terminology are expressed in terms repre- 
sentative of chemical structure diagrams, i. e., a generic term 35 
is applied to a chemical structure if a substructure or "fragment", 
defined by that term, is a part of the structure so diagrammed. 

Each term is assigned a distinctive code, which is further modified 
by an additional coding, called an “interfix", representing dia- 
grammed connectivity between the fragments. 4O 


This general scheme was later redesigned for use in a random 
access approach, in which the stored information is somewhat loose- 
ly called an "inverted file" (R10). Basic principles applied in 
this new approach include developing genus-species hierarchies 45 
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from the extracted raw data of the document, rather than by in- 1 
tuition or preconception; the ability to manipulate and modify 

the file without altering encoded data to changes in classifica- 
tion and relationship on the generic level; and, when utilized in 

a@ retrieval operation, constructing both the classification hier- 5 
archy and the variant and invariant relationships from the sub- 
ject-matter file without dependence on document identification. 


One of the descendants of this modified VS. system is pro- 
ject RAMP, in which the modified VS3 approach has been adapted 10 
to compounds containing the phosphorus acid ester structure (R11). 
The complexities arising in this latter system from a more variant 
structure are manifest in greater generality of descriptors and 
interrelationships. This system is now in pilot operation, and 4 
search file is being created for productive use. 15 


Another descendant is project DRAM, in which the inverted 
file technique of the modified VS. system is being extended to in- 
clude non-structural descriptive matter pertaining to chemical 
behavior and enviroment. 20 


In a recently started project, electrical systems components 
and fragments of such systems have been identified with distinc- 
tive codes at several levels of generality. To these concepts, 
additional descriptive terms have been developed for functions 25 
and other identifiable characteristics. 


Other means of standardizing terminology for retrieval which 
are being studied include a study of symbolic and graphical systems 
of representing word and diagrammatic disclosures of mechanical 3 
subject matter (R12). 


The difficulties involved in being able to represent a concept 
in any technical discipline in unique, precise, and unambiguous 
terminology led to more basic research into linguistics. Al- 35 
though the amount of effort applied in this direction has been 
limited, many of the problems imposed by popular language usages 
have been stated (R13). Some suggestions on studies leading to 
their resolution have been proposed, and exemplary steps taken, 
including the formation of one possible solution, a metalan- Lo 
guage, dubbed "Ruly English” (R14). Concomitantly, through con- 
tracts made by the National Bureau of Standards, research is going 
forward on using raw text as a file in information retrieval (R15). 
Creation of a file of U. S. Patents in machine-readable form for 
this research required a complex transliteration scheme (R16). ks 
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The file is now in preparation. 1 


In conjunction with research on retrieval systems, some study 
has been made of hardware for processing the data. To test retrie- 
val systems utilizing network representations, radical modifica- 5 
tions were made of the Census type Multi-Column Sorter (similar to 
the IBM 101) in a joint effort with the Bureau of the Census of the 
Department of Commerce (R17). The resulting machine, named the 
ILAS (Interrelated Logic Accumlating Scanner), is programmed in 
part by setting subject matter codes and in part by designating 10 
interrelationships at different logical levels among the subject- 
matter codes. The built-in ability of the machine to make a vari- 
ety of correlations is used for rapid testing of the logical design 
of coding systems. Existing equipment is also being used or has 
been evaluated for testing our retrieval systems. These include 15 
the National Bureau of Standards Computer SEAC for which programs 
are being written to test HAYSTAQ; the IBM RAMAC 305 computer which 
has been programmed to test "inverted-file” type arrangements of 
subject matter (R18); the Bendix G-15 computer which was programmed 
to evaluate the use of weighted values imposed on the subject 20 
matter codes (R19); and the "Peek-a-Boo" card (Batten-type card) 
equipment of the Office of Basic Instrumentation, National Bureau 
of Standards. 


Future Directions 25 





Several factors make our future course unclear, a situation 
probably shared by other research groups. Preliminarily, continued 
progress on the retrieval systems currently being devised may dis- 
close areas of emphasis not presently contemplated, or need for 30 
other unconsidered modifications. Funding and staffing are also 
factors not without significance. Additionally, a systems-engi- 
neering study of our operation is being planned by the Department 
of Commerce as a result of a Department-wide evaluation of its role 
in science and technology research (R20). 35 


The creation of productive but limited search systems in small 
areas of art, when well considered, may be helpful in extrapolating 
both the sophistication of the system and the size of the area in 
each succeedingly developed system. Our experience in obtaining 40 
small-scale files for pilot tests strengthens our conviction of the 
need of studies into methods for preparing large encoded document 
files, including abstracting methods, in which mechanization should 
be considered at all possible steps. Development of screening tech- 
niques for serial systems and other methods of shortening search 45 
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time also need consideration. 1 


Concomitantly, other more basic studies seem necessary for 

the resolution of problems which separate us from a universal solu- 
tion to the patent-search problem. Subject areas which might com- 5 
plement and carry forward present findings could profitably include 
mathematics and the formulation of a mathematical theory of retrie- 
val; structural and transformational linguistics; programming stud- 
ies for learning and self-organizing systems; the development of 
pattern-recognition machines capable of accepting letters, pic- 10 
tures, and other graphic material as inputs; and the study of the 
possible applicabil’ *y of machine-translation results, including 
the mechanized sear. ing of non-English language documents. 


The best envisaged plan of research can break down at the 15 
most unexpected places. Additionally, unsoundly conceived or in- 
adequately considered steps in a program will not invariably ex- 
pose themselves early in subsequent development or use, and will 
require more time to reform than was "saved" in their establish- 
ment. The obviation is apparent -- sufficient funds to do the 20 
job, sufficient time to do it in, and sufficient skilled staff to 
do it properly .* 


Simon M. Newman 

Acting Associate Director 25 
Office of Research and Development 

U. S. Patent Office 
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b. THE INDUSTRIAL REVOLUTION IN LAW 





There is no man living in a modern civilized country whose 
life has not been deeply affected by the scientific and indus- 
trial revolution of the last two centuries. We eat pasteurized 
milk and frozen meats and vegetables, dress in clothes made of 
synthetic materials, listen to radio, watch television, con- 
verse on the telephone, travel by automobile and airplane, sleep 
on foam rubber mattresses beneath electric blankets, and gen- 
erally are so surrounded by the new products of science and 
industry that last year's miracles are today's banalities. The 
creat majority of our population works at jobs that were cre- 
ated by scientific and industrial changes during the last cen- 
tury, whether the individual function is in production, distri- 
bution or administration. Now these jobs are being altered or 
abolished by automation, and a new category of technical occu- 
pations is arising to provide service for the automatons. 


The professions, too, have generally changed in many sig- 
nificant aspects in response to the same influences. Today 
there are fundamental differences from twenty-five years ago 
in the practice of accounting, agriculture, architecture, en- 
sineering, journalism, and, perhaps above all, medicine and the 
health fields. There is one eminent and lonely exception--law. 
For all the multiplication of precedents, statutes and regula- 
ions, there has been no significant adaptation of legal tech- 
igues or practices to any of the new developments in science, 
technology or industry. Lawyers today are doing the same things 
in the same way that lawyers did a quarter, half and full cen- 
tury ago. Now we face the guestion whether this condition will 


v 


continue. 


+ fy ct 


Wnat will the practice of law be like a quarter of a cen- 
tury from now? 


So far as present knowledge supports a prediction, it will 
continue to be the same in many important respects as it is to- 
day. But it is even more certain that in many other significant 
aspects it will be quite different. Of course, the lawyer and 
nis practice will be affected, both directly and indirectly by 
the innumerable changes that will occur in the community pat- 
tern of living as the result of scientific and technological 
developzents. For example, it seems probable that typewritten 
letters may be transmitted across the country by electronic 
facsimile reproduction at least as rapidly as telegrams are 
today. 
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Of much greater significance to the legal profession is likely l 


to be the revolution in the methods of storing and retrieving in- 
formation, and the consequent changes in legal research and the 
daily work routine of the lawyer and judge. Based upon presently 
known techniques and devices now deing utilized in other fields, 
the following sketch of legal research and practice in 1985 can 
be projected as a conservative prediction. 


The lawyer's task of research in 1985, as in 1960, will start 
with his own analysis of the problems into legal issues and cate- 
gories. With some ideas of the categories in which answers to his 
questions may be found, the lawyer will turn to a one volume out- 
line of key numbers that will, or should, be on every lawyer's 
desk constantly. This volume will be large but not too unwieldy, 
approximately a single volume of the present American Digest or 
ALR series in size. This volume will have an outline, similar no 
doubt to the outline now contained in the American Digest sys- 
tem or Corpus Juris Secondum, but somewhat more comprehensive. 
However, the outline will contain only topical headings and a sys- 
tem of code numbers. There will be no entries under any of the to- 
pical headings, and thus the outline itself will not: become cut- 

ted rapidly as the present digest volumes do. 


The lawyer confronted with a legal problem will consult his 
digest volume for all the topical headings that he thinks are or 
might be relevant or helpful to his search for legal precedents 
or rules. From each digest heading he will secure a code key nun- 
ber which he will write down on a legal data requisition card, 
printed in an appropriate form for this purpose. 


After all the digest topic headings have been transferred 
to the data requisition card, the lawyer will consult a table 
printed on the inside front cover of the digest volume for the 
appropriate code symbol to indicate the kind of legal data he 
wants. By these symbols the lawyer will indicate whether he 
wants data under the digest headings relating to his subject 
matter for his own state jurisdiction alone, other specified 
state jurisdictions, or all of them, or for the federal juris- 
diction, and also whether he wants court decisions, statutes, 
administrative regulations, administrative precedents, or any 
combination of some or all of these. 


How long it will take to fill out such a card will depend 
upon the complexity of the problem and the number of subject 
headings involved, the facility of the individual, and the 
familiarity of the individual lawyer with the particular 
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ubject matter. In many cases no more than a few minutes may well 1 
suffice. In others, it may take somewhat longer, but hardly ever 
more than an hour at most in even the most complex and difficult 


matter. 


When the data requisition card has been completed the law- 
yer will give it to a messenger or an assistant, if he is ina 
hurry, or mail it to the local law library if he has plenty of 
time or resides out of town. The law library will consist pri- 
marily of several banks of electronic data handling machines. The 10 
librarian will take the data requisition card and transfer the code 
symbols to @ punch card on a small typewriter-like machine at her 
desk. (Larger law offices may have such a machine in the office and 
prepare their own punch cards to save time. ) The punch card will 
then be inserted into the control slot of the electronic "Law Di- 15 
gest Machine", a switch will be flipped, and there will be a slight 
um as a number of printed sheets slide into a receptacle on the 
ront of the machine. Each sheet will contain a photographic re- 


my pr 


production of the digest of each case, statute, regulation, admin- 
istrative precedent or other relevant data indicated as desired 20 
on the data requisition card which has been classified to any of 

the subject headings listed on the card. It should take less than 

two minutes to make the punch card, insert it in the machine and 
secure copies of digests of all relevant items. 


Tae machine itself will be merely an electronic digital conm- 
puver, much similar to numerous such machines in general use in 
many industrial and scientific applications. It will have a con- 
trol panel looking something like the console of an organ. Behind 
this will be a cabinet perhaps ten feet wide and ten feet high. 30 
The front several feet of the cabinet will contain an immensely 
inzricate maze and mass of switches, relays, transistors, diodes, 
and printed and solid state electrical circuits. This is the 
“analyzing” or “thinking” part of the machine. It will have no 
moving parts as all “action" will be electrical. This part of 35 
the machine will "read" the message on the punch card and actu- 
ate circuits corresponding to each subject heading and category 
of data sought. 


Behind the control panel and the electronic “brain" of the 4O 
machine, there will be a bank of reels, mounted side by side and 
ne above the other, and controlled by tiny electrical motors. 
Exch reel will be about half an inch wide and about two inches 
in diameter. It will contain many feet of an extremely thin plas- 
tic tape which has a special coating on it to provide a thermo- 45 
plastic record. Data is imprinted on the tape by writing with an 
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electrobeam that sweeps back and forth across the tape when re- l 
cording is done, leaving a groove similar to, but much finer than, 
the groove on a long-playing phonograph record. By electrical means, 
this groove is melted and then frozen into the tape, making a last- 
ing record. Each tape is capable of recording and storing the e- 5 
quivalent of several hundred volumes of ordinary printing. In a 

bank no more than half a foot in thickness and ten feet square 

there are about six thousand of these reels. Each six inch thick 

bank is the equivalent of a library of over a million volumes in 
stored information! 10 


As the electrical circuits actuate the reels and their con- 
trolling mechanism, certain reels containing the desired data 
turn to pass particular portions of their tapes over a “scanner”. 
The scanner “reads” the messages printed on the exposed parts of 15 
the tapes and transmits these to an image-reproducer which is 
much like an ordinary television tube. The fourth major component 
of the Law Digest Machine is the reproduction segment. Here the 
image-reproducer lights up brightly for a fraction of a second 
with a reproduction of the printed message recorded on the ex- 20 
posed portion of the tape. As this occurs, a photographic sheet 
is exposed and developed, and then dropped into the "out basket” 
on the control panel. It will be printed in a normal type face 
and be at least as legible as a good photostatic reproduction of 
1960. 25 


Thus in the time it takes the messenger to get a drink of 
water from the old-fashioned cooler in the corner of the room, 
the machine will have prepared printed copies of all digest par- 
agraphs that have been subsumed under any of the headings and 30 
categories in which the lawyer indicated an interest. These will 
be returned to the lawyer who can then read them at his conveni- 
ence. 


Some of the digests will certainly appear to be irrelevant 35 
and will be discarded. Others will probably be saved for future 
reference in the file. Some will certainly require further in- 
vestigation. These may refer to cases, statutes or regulatioas 
that are available in the lawyer's library of conventional books. 
However, as the number of decisions, statutes, regulations and 4O 
rulings continued to multiply it will become increasingly diffi- 
cult and expensive and commensurately less useful or profitable 
to attempt to maintain a reasonably current law library on the 
basis of conventional volumes. Consequently law office libraries 
everywhere will become smaller, rather than larger, as more and 45 
more reliance comes to be placed upon the central library with 





« 60 - 


its immense and efficient machines for the electronic retrieval of 1 
data. Therefore, it is likely that in many situations the digests 
will indicate important material to be examined in full that is not 
at hand in the office law library. But this will not involve any 
difficulty. 5 


Each digest will itself bear a code designation. The lawyer 
will note on another data requisition card the numbers of each 
digest referring to material he wants in full text. The messenger 
will take the second card back to the law library, and in a simi- 10 
lar brief period from essentially the same kind of machines, will 
obtain the complete text of each desired case, statute, regulation 
or other reference. The lawyer who desires to do so can, of course, 
take his own requisition cards to the law library; examine the di- 
gests immediately; secure the texts of such references as he de- 15 
sires at the time; and then return to his office for further stu- 
dy and analysis of the authorities thus secured. 


This system offers many obvious advantages over the cumber- 
some and inadequate system of legal research employed in 1960. 20 
Perhaps the most important advantage is that it offers the law- 
yer - for the first time in modern history - the opportunity to 
do a complete job of legal research on any given problem within 
whatever limits may be specified as relevant. The vast capacity 
of the electronic machines for storing and retrieving data vill 25 
permit classification, storage and retrieval in a systematic 
manner not Only of appellate precedents, but also of statutes, 
administrative regulations and administrative and executive 
rulings. Until electronic retrieval of data is a reality, ig- 
norance of at least administrative law will inevitably continue 30 
to be more the rule than the exception. 


Another advantage of electronic research will be the assur- 
ance that the results are up-to-date. There is no field of law 
in which the 1960 system of reports, digests, supplements and 35 
loose-leaf reporters gives real assurance that all of the re- 
cent cases, statutes and regulations will be notcd, even with 
extensive and arduous cross-checking. However, once a reference 
is reported and recorded in the Law Digest Machine, it will 
automatically come out in response to an input of the appro- 4O 
priate digest or code number. The most recent digests will be 
as guickly and easily accessible as the most ancient. 


Aside from the assurance it gives to lawyers of complete- 
ness and efficiency, the greatest advantage of the electronic 45 
retrieval system of legal research will be its ability to save 
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@ lawyer's time. Instead of looking through numerous volumes 1 
and supplements in order to secure authorities on a point, the 
search will be confined to a single, familiar outline digest 
that gives the code numbers needed to actuate the machines. From 
this point on, references are secured merely by pushing figura- 5 
tive electronic buttons for the digest numbers. Eventually, of 
course, the references must be read and analyzed; but the time 
spent in merely finding potentially relevant authorities will be 
reduced from hours to minutes. 

10 

As incidental advantages will be the convenience of having 

authorities furnished in separate, convenient, printed form. 
Additional copies will be equally available. Such copies may thus 
be marked as desired and filed with other papers without the ne- 
cessity of taking valuable secretarial time to make length ex- 15 
tracts from books. 


For the small town lawyer, the electronic retrieval system 
will be a real equalizer. Under the 1960 system a fairly exten- 
sive law library is needed for thorough research of any legal 20 
point. Such libraries are usually available only in the larger 
metropolitan centers. On the other hand, long distance research 
through a correspondent is difficult, expensive and usually un- 
reliable. But under the electronic retrieval system, the only 
difference between the lawyer most remote from the largest city 25 
and one living in it will be the time it takes for material to 
be transmitted by mail. The small town lawyer will have the same 
digest outline of key numbers as the metropolitan lawyer. He will 
use the same data requisition cards. And his cards will produce 
the same data from the same machines. 30 


The one advantage that the electronic retrieval system will 
not have is economy. The completeness, efficiency, speed, con- 
venience and availability of the system will have to be paid for. 
Undoubtedly payment will be made by those who use the system. 35 
The most obvious method of charging and collecting payment is by 
exacting a small fee for every page of data that is delivered 
on any lawyer's request. The charge could be as little as a frac- 
tion of a cent or, perhaps, as high as ten cents a page. It might 
well vary with the type of material sought, being different for 40 
digest pages, statutes and decisions. Presumably some arrange- 
ment for monthly deposit or charge accounts would be worked out, 
so that most lawyers would not have to keep a handful of petty 
cash around for every trip to the law library. It is impossible 
to estimate whether the aggregate cost of electronic legal re- 4S 
search will be more or less than the cost of maintaining a rea- 
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sonably useful office law library in 1960. However, it does 1 
seem clear that taking time and efficiency into account, the 

more modern system will be far more profitable for the lawyer - 

and his clients - than the cumbersome and inefficient hunt and 
check system presently in use. 5 


One aspect of legal practice that will not be changed merely 
by instituting a system for electronic retrieval of legal data is 
the thinking of lawyers. The revolution that we have been conten- 
plating is an industrial revolution--a change in techniques and 10 
patterns of work resulting from advances in thinking in other 
fields. This is not a scientific revolution--a change in the 
basic thought patterns in our own field. On the contrary, any 
system of electronic retrieval of data will necessarily be based 
upon 4 classification of data made before the system itself is 15 
put into operation. Thus it will have some tendency to carry for- 
ward whatever categories and cliches inhere in the thinking of 
lawyers when the system is established. This does not mean that 
the law will become fixed and immtable--any more than the system 
of digests and reports has made the law fixed and immutable. 20 
Indeed, it may well be that electronically recorded data can be 
more easily reclassified than that which is preserved in the con- 
ventional printed page. But there is nothing inherent in the 
electronic retrieval of data itself which will alter any legal 
concepts or procedures or make the legal system any more flex- 25 
ible or responsive in substantive principles to contemporary con- 
ditions. 


On the other hand, there are developments in modern logic and 
experimental science which at least suggest the desirability of 30 
new thinking in many areas of the law. We can only hope that as 
lawyers and judges are freed from some of the demands of routine 
and drudgery in the performance of their functions, they may have 
greater opportunity and impulse to devote themselves to the ef- 
fort of intellectual creation necessary to keep the substance of 35 
the law es modern as its techniques promise to become. 


Lee Loevinger 
Minnesote Supreme Court 
ho 
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c. THE MINICARD SYSTEM FOR HIGH SPEED DATA RETRIEVAL 1 





The MINICARD System is a high-speed information handling 
system conceived by Eastman Kodak Company and marketed by Recordak 
Corporation, Kodak's microfilm and office equipment subsidiary. 5 
The heart of the MINICARD System is the MINICARD film record on 
which information is photographically recorded to provide a "perma- 
nent" memory. (Fig. 1) This unit record is a tiny piece of micro- 
film, measuring 16 x 32mm. which provides for compact storage and 
automatic retrieval. The MINICARD film record has the advantage 10 
of coupling the document image and the index code as a single 
unit. It is this combination that both reduces the space requir- 
ed for storage and enables the high-speed searching which is 
characteristic of the system. Up to 12 documents--printed pages, 
charts, tables, photographs or other graphic material can be re- 15 
corded at 60:1 reduction on each MINICARD film record. In ad- 
dition to 12 images, the film also may contain 49 characters of 
code for indexing. By reducing the number of images, the amount 
of code on each unit record can be increased. For example, a film 
record with six images has space for 259 characters of code, and 20 
a full coded film record (no images) allows up to 455 characters. 
A binary code system is used which consists of a series of clear 
and opaque squares exposed on the filn. 


The MINICARD film records are handled on sticks, each stick 25 
holding a total of 2,000 film records. The film records contained 
on a stick are stored in magazines. There are 50 magazines in 
each file block, each magazine capable of storing one full stick 
of film records. The file blocks are stored in a 15-inch by 30- 
inch by 50-inch, three-drawer file cabinet whose drawers can 
each hold three file blocks. Thus, each file cabinet can store 
a total of 900,000 film records, which can contain a maximum of 
10,800,000 pages of documents along with 44,000,000 index code 
characters. By comparison, this amount of original material 
would occupy over 500 cabinets requiring 2,500 square feet when 
stored in standard files. 


© 


Ww 


Ww 
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In order to achieve tne photographic quality necessary to 
the success of the MINICARD System, a new film emulsion was 
created by Eastman Kodak. This fine-grain emulsion is of ex- 4O 
tremely high resolution--actual resolution of the raw film is 
more than 4 times greater than standard microfilm or over 1,000 
lines per millimeter. Also, new lenses of high resolving power 
and advanced film handling techniques were developed to fulfill 
the requirements of the system. 
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Approximately 14 photographic, electronic, mechanical and 1 
optical devices are required to record, code, store, sort, select, 
process and correlate information in the MINICARD Systen. 


Kodak has built four complete MINICARD Systems to date and 5 
has two Government contracts for additional systems. 


As the first step in setting up a file, the material must be 
indexed according to pre-established classifications. Cross- 
indexing of individual items can be extended as far as desired, 10 
since duplicate MINICARD film records can be automatically created 
in the MINICARD equipment for cross-filing. The document code is 
converted to a punched paper tape, read electronically and re- 
corded on the film. The related graphic material is then recorded. 
After the roll of film is processed in the MINICARD continuous 15 
processor and inspected, it is automatically cut into individual 
records and stacked on the handling sticks. These "first gener- 
ation" negatives make up a master file from which duplicates are 
made for use in expanded working files. A typical document might 
be recorded and stored in 5 to 10 places, enough to give suffi- 20 
cient complete and separate records to be filed according to as 
many different concepts as are mentioned in the document. These 
"second generation” MINICARD film records are sorted into maga- 
zines in the storage blocks as directed by the filing code which 
has been automatically recorded in the proper position on each 25 
expanded unit record. Therefore, since each file category con- 
tains a film record of every document related to it, only e small 
section of the files needs to be searched to locate specific in- 
formation. (Fig. 2 - Input) 

30 

Retrieval is accomplished in the following manner. Requests 
for information are coded and punched into tape. The working file 
category likely to contain this information is inserted into the 
selector. Using the tape and a control panel plugboard which in- 
dicates selection logic, an electronic selector scans the coded 35 
section of each MINICARD film record at a rate of 1200 cards per 
minute, selecting all those answering the request. A typical 
search usually takes about 5 minutes. 


One pass under the scanner reads the whole record. The LO 
equipment is designed to handle multiple record groups as if they 
were a single record permitting virtually unlimited capacity for 
associated document images and code. 


The selected records are collected in a separate magazine, 45 
duplicated automatically to produce "third generation” negatives 
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and then returned to the file. The "third generation" MINICARD 1 
film records go to the requestor who can study them in a viewer or 
get enlarged hard copy reproductions. In this manner the working 
file is always intact except for the few moments while the 

material is being searched. The "third generation" film records 5 
may be mounted on aperture cards to be kept as a personal file by 
the requestor or retained in original form on small handling 

sticks. These duplicates or the hard copy reproductions can be 
retained by the requestor in his own files or thrown away when he 

no longer needs them. (Fig. 3 - Output) 10 


As yet, an off-the-shelf price tag has not been established 
for individual components or for a complete installation. To 
establish a price for a particular application, it would be neces- 
sary to perform a systems study. If desired, the initial expense 15 
of a system could be split among several users. One way to do 
this would be to set up a centralized operation to handle the job 
of coding and preparing records. Duplicate film records could 
then be stored in several working or satellite locations. These 
might be associated with the central operation or might be sepa- 20 
rate organizations who would buy the service from the central 
user. Each of the satellite locations would then need only the 
searching system. 


Summarizing MINICARD System characteristics, we find: 25 


1. The MINICARD film record contains both graphic material 
and the digital code essociated with that material--a 
true complete unit record. 

2. The MINICARD film record combines the mobility of the 30 
tabulating card and the compactness of microfilm. 

3- Any kind of documentary information will reproduce on 
MINICARD film records. 

4. Single search retrieval at relatively high speeds pro- 
vides information which is immediately available for 35 
reference or for enlarging as original size paper copies. 

5. Cross-indexing classification is unlimited. 

6. Expendable duplicate film records are delivered thus 
keeping the working file intact. 

LO 

The MINICARD System is a modern, electronic microfilm medium 

for storage and retrieval of documentary information. 


Allen M. Rahm 
Recordak Corporation 45 
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a. STORAGE AND RETRIEVAL OF LEGAL INFORMATION: 
POSSIBILITIES OF AUTOMATION 








The following is a brief summary of portions of a study un- 
dertaken at Yale Law School under the sponsorship of the Walter 
E. Meyer Research Institute of Law. The first part of the study 
was aimed at making a useful estimate of the magnitude of the 
legal information-processing problem in the near and long-range 
future. The second part attempts a first approximation of a com- 
prehensive description of the legal research process. The goal 
of this description is 


1. to begin to distinguish (a) those parts of the legal 
research process which are routine enough to be ame- 
nable to automatic processing from (b) those parts 
which will continue to require human judgment because 
of their subtlety or complexity and 


2. to give an over-all picture of the legal research pro- 
cess into which the detailed descriptions of the vari- 
ous parts of the total process can be fitted, as these 
descriptions are produced in later studies. 


The first step in determining trends of growth in legal lit- 
erature is an investigation of what is being referred to by the 
term "legal literature”. The most comprehensive definition of 
“legal literature” would include as legal literature all docu- 
ments containing information which could be useful to some law- 
yer in his professional work. However, such a definition is so 
broad that it might well include just about all of the world’s 
written matter as legal literature. Perhaps, when the costs of 
storing information have been sufficiently reduced, the ideal 
law library will contain all information that has been written. 
But such is certainly not the case in the year 1960. 


However, even a bit of reflection shows that any definition 
of legal literature is arbitrary, if it sets up a dichotomy be- 
tween that literature which is and that which is not legal lit- 
erature. It might seem possible that, as an alternative to a 
dichotomous definition, all documents could be arrayed along a 
continuum, so that each document would be associated with a 
number indicating the degree to which that document is legal 
literature. In such an array, statutes, case opinions, rulings 
of administrative agencies and "legal" periodicals would tend 
to appear at the "high" end of the continuum; items such as 
arithmetic primers and cookbooks would tend to collect at the 
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"low" end. But a vast amount of material would inevitably lie in 1 
the broad middle of the continuum. If a “definition” is insisted 
upon, then once such an ordering is made, a point on the continuum 
could be selected--arbitrarily--and only those documents that are 
above that point "defined" as legal literature. Who is qualified 5 
to say where that point is? Moreover, even the very ordering of 
documents along such a continuum would be arbitrary. Who is qua- 
lified to determine this ordering? Perhaps an appreciation of some 
of the difficulties involved in attempting to define "legal lit- 
erature" will make it easier to understand why this problem has 10 
not been resolved (or even extensively pursued) in this present 


study. 


In light of the difficulties of specifying a workable def- 
inition of "legal literature”, the course taken was to look into 15 
a few areas which are undoubtedly part of legal literature in 
order to see what has happened to the rate of growth in these 
areas through time. The rates of production of the sampled 
statutes, case opinions, and legal periodicals are summarized 
in Figures 1-3. It should be emphasized that in Figures 1-3, 20 
it is rates of production that are represented, and not the 
accumulated totals of what has been produced. Figure 4, however, 
shows the total number of volumes in the Law Library of the U.S. 
Library of Congress at various times from 1832 to recent years. 

25 

Even a cursory examination of these four graphs suggests 
that if present trends continue, the information-processing pro- 
blem of lawyers in 1985 is going to be of no small magnitude. 

Perhaps when lawyers begin to be swamped by a deluge of litera- 

ture, they will come to a keener appreciation of the efforts 30 
being made currently by researchers and librarians in the na- 

tural sciences to devise methods for performing some parts of 

the research process automatically. 


Closer examination of the four graphs reveals important 35 
differences in the rates of increase in the rates of production 
of statutes and periodicals as contrasted with that of case 
opinions. For statutes and periodicals, the rates of increase 
are increasing, whereas the rates of increase for case opinions 
are more nearly constant. It is interesting to speculate whe- 4O 
ther the rate of production of legal periodicals will continue 
to follow the pattern of change for periodicals in other fields. 
For, notice that there seems to be a twenty-year lag between the 
bidlogy curve and the law curve, and a forty-year lag between 
the medicine curve and the law curve, but that the curves show 45 
close correspondences in shape. (There are growth curves in 
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agriculture, in engineering and technology, and in the natural 1 
sciences similar to those shown for medicine and biology (RI at 250) .) 


Even if the trends toward increased production of legal lit- 
erature during the past century do not continue during the next 5 
twenty-five years--even if the rates of increase level off com- 
pletely--there will, nevertheless, continue to be a build-up in 
the total accumulation of legal literature. 


There is, of course, the question of literature's becoming 10 
obsolete, and such obsolesence will tend tooffset this build-up 
of legal literature to be searched. But this question of Obso- 
lesence is another interesting and unresolved one. Just when is 
a case or a legal article sufficiently old that it is safe to re- 
gard it as being obsolete--safe to remove it from the “memory” of 15 
the computer so that it will no longer be available to be re- 
trieved? The difficulties with adopting any arbitrary criteria 
of obsolesence for use in an operating legal information-retrieval 
system will be apparent to every lawyer. Who knows with such cer- 
tainty what information is definitely not going to be helpful in 20 
advising a client, drafting a document, or persuading a judge? 


The pattern of growth of legal literature helps to show why 
problems of automatic retrieval of information should he of in- 
terest to lawyers. If computing machines are to be made useful 25 
in legal research, what is needed next is a detailed description 
of the process of storage and retrieval of legal information as 
now performed. Only when such a description is available will 
the computer specialists be in an advantageous position (1) to 
advise concerning which parts of the present research process 30 
can be automated or (2) to suggest entirely new ways of achieving 
the present goals of legal research. 


In describing the process of storing and retrieving legal 
information it is convenient to make use of diagrams, proceeding 35 
from a general description to a successively more and more de- 
tailed description. Figure 5 represents our most general de- 
scription of storing and retrieving legal information. 


is to be regarded as an input to the cesses) which it pre- 

cedes ( i.e.,7}+—>C_ _and and is to be regarded 40 
as an output from the Sere ence which it succeeds ( i.e. 

c>— y_} and __<—c>). An (val) represents a cess 


for which the LBOKES) axe. are inputs or outputs. Thus, in Figure 1 
there is A FOR SOME INFORMATION AS GUIDANCE FOR TAKING SOME 
ACTION TO HANDLE SOME SITUATION (abbreviated NEED) which is 45 
an input to the Research Process. Out of the Research Process 
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there are two kinds of conclusions as outputs: 1 
--A CONCLUSION TO STOP RESEARCH (abbreviated STOP RESEARCH) 
--OTHER CONCLUSIONS (abbreviated CONCLUSIONS). 

The CONCLUSIONS feed back as inputs to the Research Process. The 

description of the storing and retrieving of legal information 5 

given in Figure 5 is so general that it is not very useful. It 

does not indicate which of the various subprocesses of the Research 

Process are routine enough to be automated. 











One step toward a more detailed description is made in Fig. 10 
ure 6, in which the relationships between the various subprocesses 
and their inputs and outputs are shown, as well as the relation- 
ships between the subprocesses and the inputs and outputs to the 
Research Process. The O-1 NEED is an input to the 1. Definition 
of Problem Process, which has as its output a 1-2 STATEMENT OF 15 
THE PROBLEM (abbreviated PROBLEM). The 1. Definition of Problem 
Process with its inputs and outputs is the part of the Research 
Process in which the lawyer clarifies the goals that he is seeking 
in a particular situation, by considering (either implicitly or 
explicitly) the discrepancy between the current state of affairs 20 
and a preferred future state of affairs. 














The 1-2 PROBLEM is an input to the 2. Document Retrieval 
Process, which has as its output a 2-3 RETRIEVED DOCUMENT. The 
2. Document Retrieval Process along with its inputs and outputs 25 
is the part of the Research Process in which the lawyer finds the 
documents that he will read in handling a given situation. 














The 2-3 RETRIEVED DOCUMENT, the O-3 NEED and the 1-3 PROBLEM 
are inputs to the 3. Information Selection Process, which has as 30 
its output 3-4 SELECTED INFORMATION. The 3. Information Selection 
Process along with its inputs and outputs is the part of the Re- 
search Process in which the lawyer reads the documents that have 
been found and selects such information as 

--citations to other relevant documents, 35 

--information contributing to the solution of the problen, 

--information contributing to a reformulation of the request 

and 
--information contributing to a redefinition of the problem. 




















4O 
The 3-4 SELECTED INFORMATION and the 1-4 PROBLEM are inputs 
to the 4. Information Utilization Process, which has es its out- 
puts 4-1 CONCLUSIONS, 4-2 CONCLUSIONS and 4-0 STOP RESEARCH. The 
4. Information Utilization Process along with its inputs and out- 
puts is the part of the Research Process in which the lawyer uses 45 
the gathered information to choose what action to take in dealing 
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with the situation and to determine whether to and how to proceed 
in further research. 


Although certain distinctions are beginning to emerge which 
inguish processes that can be automated from those that can- 
-&-, one might suspect that parts of 2. Document Retrieval 
an be fone aut cuascen! y), the description given in Figure 6 is 
ill not fficiently detailed to be of much assistance. It also 


now sul 
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several eanghine | featuz 5: 

--Why NEED an input to 3. Information Selecticn? 

--Why is SELECTED INFORMATION an input to 2. Docu 
Retrieva 

--Why is 


ata 


--Why is 
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CONCLUSION an input to 1. Definition of Problem? 
CONCLUSIONS an input to 2. Document Retrieval? 
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hese features is facilitated by a more 
eiled de sonesthehen: Notice that, in moving from a general to a 
e detailed description of the storing and retrieving of legal 
ormation, in Figure 6 a component part of Figure 5 was analyzed 
its subcomponents. A still more detailed description of the 
age-and-retrieval proc ese results when the following parts of 
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2. Document Retrieval 
3-4 SELECTED INFORMATION 
4+. Information Utilization 
G=1 CONCLUSION 
4-2 CONCLUSIONS 
Tnis more detailed description is given in Figure 7. 

















The 2. Document Retrieval Process is represented in Figure 7 
by a combination of the following processes and their inputs and 
OUvL PUTS: 

O-25 INCOMING DOCUMENTS 

< Preliminary Analysis and Storing 

2E-2B DESCRIPTION FILE 

2E-2D DOCUMENT FILE 

















2A. Formulation of Request 

2A-28 REQUEST 

2e-28 DESCRIPTION FILE 

2B. Match Request With Descriptions 

23-20 CITATION 

2. Citation Selection 

2-2) MOST RELEVANT CITATION 

2E-2) DOCUMENT FILE 

2). Match Most Relevant Citation With Documents 
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A suitable combination of the first four parts listed above 
corresponds to what is usually regarded as a library. A suitable 
combination of the last nine parts corresponds to the process of 
retrieving documents from a library. 


The 3-4 SELECTED INFORMATION is represented in Figure 7 by 
the following five inputs and outputs: 

3-20 CITATION 

3644 INFORMATION THAT FAILS TO CONTRIBUTE TO SOLUTION OF 

THE PROBLEM 

Se INFORMATION CONTRIBUTING TO SOLUTION OF PROBLEM 

3- INFORMATION CONTRIBUTING TO REFORMULATION OF REQUEST 

3-4 INFORMATION CONTRIBUTING TO REDEFINITION OF PROBLEM 




















The 4. Information Utilization Process is represented in 
Figure 7 by the following combination of processes and their 
inputs and outputs: 

4A. Choice 

fA-kB SOME ACTION IN SITUATION 

48. Choice 

fc. Choice 

fc-ED SOME ACTION IN SITUATION 

1D. Choice 

GE. Choice 

fg-GF SOME ACTION IN SITUATION 














Finally, the 4-1 CONCLUSION is represented in Figure 7 by: 
4F-l REDEFINE PROBLEM 
and the 4-2 CONCLUSIONS are represented by: 
4B-20 CONTINUE RESEARCH 
fh-2c CONTINUE RESEARCH 
4F-20 CONTINUE RESEARCH 
4D-2A REFORMULATE REQUEST 























To see how Figure 7 helps in the construction of a concise 
description of the procedure of any lawyer in "manually" re- 
trieving legal information in order to handle a given situation, 
consider the following six examples. 


1 


10 


20 


25 


30 


35 


4O 


45 


Example 1 1 


Attorney Adams defines the problem in a situation he is han- 
dling, and then gets the most relevant document he can find. From 
reading this document, Adams 5 


a. Giscovers some information contributing to the solution 
of the problen, 


b. Gécides to take some action with respect to the given 10 
situation and 


c. decides to continue research. 


Using the alpha-numeric symbols associated with the pro- 15 
cesses represented in Figure 7, we can construct the following 
diagram showing the kinds of and temporal sequence of Adams’ 
research activities. 


20 
Diagram 1 

25 

1 --> 2A --> 2B --> 2C --> 2D --> 3 --> 4A --> 4B --- 

ok Le) c: 

L-3 2C 
30 
In interpreting Diagram 1, consider first the following part: 
35 
1 --> 2A --> 2B --> 2C --> 2D --> 3 

4O 


This part of Diagram 1 is intended to indicate: 


45 
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l. Adams defines the PROBLEM from the description of the + 
situation that has been presented to him. 


2A. Adams formlates a REQUEST for information. This he may 
do explicitly by external communication to another per- 9 
son (e.g., asking the librarian for all the articles 
dealing with the INCOME TAX effects of BEQUESTS to 
CHARITABLE ORGANIZATIONS that have appeared during the 
past 10 years in HARV. L. REV., COL. L. REV. or YALE 
L.J.), or he may make a private "request" to himself 10 
(e.g., by deciding simply to reach for his copy of the 
Internal Revenue Code and the Treasury Regulations). 


28. Adams (or his librarian) matches the REQUEST with the 
descriptions in the DESCRIPTION FILE (e.g., he looks 15 
up the word CHARITABLE ORGANIZATIONS in the card 
catalogue or some other index). 


2c. From the list of CITATIONS found by matching REQUEST 
with DESCRIPTIONS, the CITATION of the documant that 20 
Ought to be read first is obtained. The citation of 
that document is called the MOST RELEVANT CITATION. 


2D. Adams (or his librarian) matches the MOST RELEVANT 
CITATION with the documents in the DOCUMENT FILE, i.e., 25 
he uses the CITATION to locate the document. 


3. Adams readSthe retrieved document. 


The part of Diagram 1 just considered will be duplicated in the 39 
diagrams of each of the examples to follow. 


The part of Diagram 1 labeled ‘a' indicates that in reading 
the first document retrieved, Adams found some information con- 
tributing to the solution of the problem. The part labeled 'b' 35 
indicates that on the basis of this information he decided to 
take some action with respect to the situation. Finally, the part 
labeled 'c' indicates that Adams concluded that further research 
was needed, and so returned to the 2. Citation Selection Process 
to begin anew the process of obtaining further documents to read. 40 





Example 2 


Attorney Baker defines his problem and reads the most relevant 
document that he can find, but he 45 


@. does notdiscover any information that contributes to 








“wt 


a 


ee te eed 


-D- 
the solution of his problem (Nevertheless he mst stop 1 
research because his boss has given him a more important 
job to do now.), 


db. decides nevertheless to take some action with respect 5 
to the given situation and 


ce decides to stop research. 


By using Figure 7 the following diagram can be constructed for 10 
the second example. 


Diagram 2 
1 --> 2A --> 2B --5 & --> BD --> 3 --> 4A --> 4B --p 0 15 
a b Cc 


Diagram 2 is just like Diagram 1 except that here 4B feeds to 
O, i.e., the research process stops, whereas in Diagram 1 there 20 
is feedback to &, i.e., the research process continues. 


Example 3 
» « « Attorney Charles 25 


a. discovers some information contributing to a redefini- 
tion of the problen, 


b. decides to take some action with respect to the given 30 
situation and 


c. decides to redefine the problem and begin research on 
the new problem. 


35 
By using Figure 7 the following diagram can be constructed for 
the third example. 
Diagram 3 
aaernanionpeedaivane Lo 
1 --> 2A --> 2B --> XH --» BD --9 3 --9 HE --» UF --- 
a Le) e. 
Le» 1 
45 








~~ 


The most sigmificant feature of Diagram 3 is c, the long 1 
feedback from 4F al) the way back to the beginning of the Research 
Process at 1. What c indicates is that the output of the ¥F Choice 
Process is a conclusion to redefine the problem, which feeds back 


to 1. Definition of Problem Process. 5 








As a lawyer ecquires information in working on a problem, he 
often becomes aware that his initial requests for information need 
to be modified. This may be because the acquired information indi- 


cates that the initial definition of the problem is erroneous 10 
and in need of modification. In doing research with respect to 
@ given situation, there may well be a series of successive approx- 


“wVe ae 


the request. In the present "manual” form of legal research, there 
is a continual interaction between the lawyer and his "retrieval 15 


+2 


system". And it seems likely that any adequate system of re- 
trieval must provide for such continual interaction between 


lawyer and retrieval system--even an automatic one. Such inter- 
action is necessary so that what is asked of the system can be 
modified as the inflow of information enables the lawyer to 20 


f 
f 
wii " 
clarify his goals. 


+» « « Attorney Daniels 25 
a. discovers information contributing to a reformation 
of his initial request and 


ra 


bd. decides to reformlate the request without taking any 30 
action with respect to the situation, and to begin 
retrieving documents anew on the basis of the reformatted 


request. 


By using Figure 7 the following diagram can be constructed for 35 
the fourth example. 


Diagram & 
1 ==} 2A --3 2B --9 DH --3 D --> 3 --9 KC ---, ko 


a v! 


quam! 





poccesce--- <= -<- - 


45 
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Example 5 1 
« « e Attorney Evans 
a. discovers a new citation to a relevant document. 5 
Diagram 5 
1 ==} 2A =-> 2B --} 2 --> D --> 3 --5 
a} 10 
Cre" ewe 


When Evans is referred back to the 2. Citation Selection Process, 
he may choose either to continue reading the same document he had 15 
been reading when the new citation was encountered, or to read 

some other document. It is not inconceivable that the criteria 

for choosing citations in the 2. Citation Selection Process 

could be made sufficiently explicit that some or all of this 








process could be performed automatically. 20 
Example 6 
« « e Attorney Ford 
25 
@. discovers a new citation by reading the retrieved 
document, 
be. selects a new most relevant citation, 
30 
¢c. retrieves the document which matches that new most 
relevant citation, 
ad. discovers information contributing to redefinition of 
the problem by reading the retrieved document, 35 


e. decides to take some action with respect to the 
situation, 


f. decides to redefine the problem and begin research anew 40 
on the basis of the redefined problem, 


g- retrieves another document, 


h. discovers information contributing to solution of the 45 
problem by reading this most recently retrieved document, 








- 6 - 


i. decides to take some action with respect to the situ- 1 
ation and 


j- decides to stop research. 
y) 
Using Figure 7 it is easy to generate the following concise 
description of Ford's research procedure. 


Diagram 6 


10 
1 =-) 2A --} 2B --9 2 --> D --> 3 --5 
a 
<-> 2X --} D --> 3 --> ME --> UP --4 
b c d e f! 15 
Led 1 wad 2A mn} 2B an} DH on} QD ond 3 --> 4A ==> 4B --9 0 
g & € & € h i J 
20 


The usual pattern in doing research to handle a complicated situ- 
ation probably involves many more such cycles through the research 
process -- many redefinitions in order to adequately refine the 
problem, many reformulations in order to get the request that will 
produce the most relevant information, perhaps hundreds or even 25 
thousands of citations discovered and evaluated, and perhaps many 
intermediate actions taken to deal with the situation -- before 

any ultimate solution is determined and research is stopped. 


he several puzzling features of Figure 6 may seem somewhat 30 
less puzzling now that Figure 7 has been scrutinized closely in 
tracing through the examples just given. Since one of the out- 
puts of the 3. Information Selection Process is 3-4E INFORMATION 
CONTRIBUTING TO REDEFINITION OF PROBLEM, there must be criteria 
tor the selection of such information. Certainly, 0-3 NEED will 35 
be wanted as an input to 3. Information Selection for inclusion 
among such criteria. In Figure 7 it also becomes apparent why 
3-2 SELECTED INFORMATION (of Figure 6) feeds back to 2. Document 
Retrieval (of Figure 6); the 3-2C CITATIONS selected in the 
3. Information Selection Process are fed back into the 2D. Cita- 40 
tion Selection Process. Similarly, the reasons why 4-1 CONCLUSION 
is an input to 1. Definition of Problem and 4-2 CONCLUSIONS an 
input to 2. Document Retrieval are evident from an inspection of 


Figure 7. 



































45 
The degree of detail in Figure 7 provides a description of 
the storing and retrieving of legal information that is at last 
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beginning to look interesting from the viewpoint of distinguish- 1 
ing processes that can be automated from those that require human 
intervention. It has been mentioned that perhaps some or all of 

the 2. Citation Selection Process could be done automatically -- 

if the criteria for selection could be made sufficiently explicit.5 
The processes labeled with an '** in Figure 7 are ones that can 

be done automatically in some retrieval systems, e.g., 2A. Form- 
lation of Request, 28. Match Request with Descriptions and 

2D. Match oor Relevant Citation with Documents are automated in 

@ system proposed by IBM researchers (R2). It should be men- 10 
tioned also that nearly all of GE. Storing and Prelimin 

Analysis can be done automatically in thts same LEM retrioval 

sys « What is yet needed to make this process wholiy automatic 

is a general purpose scanner for transliterating prose into a 
machine-readable form. It should be noted that this IBM retriev- 15 
al system seems to put much emphasis upon the frequency of occur- 
rence of certain of the words in a document and relatively little 
emphasis upon the relationships between these words. (See 

Newman*s comment on this with respect to patent~search problems, 
MULL 603187.) 20 











Observe that it is the 2. Document Retrieval Process that 
the parts that seem most susceptible to being automated occur. 
It is somewhat difficult to conceive how mich, if any, of the 
1. Definition of Problem Process can at the present time be dele- 25 
gated to a machine. The same comment applies to the 3. Informa- 
tion Selection Process, with one small qualification: recognition 
of citations can probably be done automatically. 











We have presented our first approximation of a comprehensive 30 
description of the storing and retrieving of legal ‘nformation. 
Hopefully, this description will stimlate comments that will aid 
in revising it to eliminate errors and amissions. T&en, the re- 
search task for the future is the identification of subcomponents 
of the parts distinguished in Figure 7, and only after that can 35 
it be ascertained which subcomponents will next surrender to the 
wizardry of the electronic age. 


Layman E. Allen, 

Robin B. S. Brooks and Lo 
Patricia A. James 

Yale Law School 


45 
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3. REVIEWS AND ABSTRACTS l 
a. Machine Data Processing Systems For The Trial Lawyer 


by Roy N. Freed, THE PRACTICAL LAWYER, 
April 19600, p. 73. 5 





(Author's Abstract) 


This article describes a newly devised technique for manag- 
ing large accumulations of trial evidence in major litigation by 10 
means of a punch card machine data processing system. The tech- 
nique requires a careful classification of the various facets or 
each case, based in large measure on the underlying trial strate- 
gy- Each item of evidence is coded in terms of the categories 
thus established, for retrieval as the litigation progresses. 15 
The greater effort in setting up a system of classification than 
normally might be expended probably will be rewarded in terms of 
better case preparation. To facilitate the process, forms are 
suggested by which the lawyer simultaneously analyzes each item 
and conveys coding instructions to the card puncher. 20 


As advantages of the new technique, it is claimed that the 
ttention of lawyers to non-legal details will be minimized and 
that the ready aveilability of all items of evidence will be 
assured despite personnel changes. The great expense of trial 25 
record searching by lawyers will be reduced, and the failure to 
discover important facts because of human perception failings 
should be eliminated. 


Because only a card punch and a sorter are required, the ex- 30 
pense of such a technique is comparatively low and might be less 
than savings accomplished by it. 


It is suggested that the technique be utilized by courts and 
administrative tribunals as well as by the litigants as a means 35 
of reducing the time and effort required for reaching decisions 
in large cases. Such a use would be more likely if opposing 
counsel could agree on an index of the record in terms of which 
the tribunal's deck of cards could be set up but which would 
not disclose the trial strategy of any party. 40 


The article attempts to acquaint lawyers with the mechanics 
of machine data processing in the hope that their awareness of its 
intricacies and of its substantial benefits will stimulate greater 
efforts toward the performance of legal research with the aid of 45 
@ more sophisticated, fully electronic variety of such a machine 
system. 
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6. LEQTERS OF READERS AND OTHER MISCELLANEOUS ITEMS 1 


a. POLICY STATEMENT OF WEST PUBLISHING COMPANY ON AUTO- 
MATIC RETRIEVAL OF LEGAL INFORMATION 








In a letter to the editor of MULL, William W. Marvin, Vice 
President of the West Publishing Company, enclosed the following 
statement of policy on West's present position with regard to 
automatic retrieval of information in lav. 

10 
"What the future may hold in this day of inventive ad- 
vances, we do not attempt to foretell. Our findings, as of to- 
day, convince us that although electronic data processing com- 
putors have been developed to an incredible degree, they still 
do not, as of now, offer a practical medium for legal research. 15 


"There have been several attempts to adapt microfilm to 
current legal publications in daily use. Here, too, present day 
developments are not adequate for everyday general library re- 
search which is the real goal. We could not ask our subscribers 20 
to adopt, at a very substantial cost, a process still in an ex- 
perimental stage. 


"Our responsibility as major publishers obligates us to 
consider the needs of the profession both collectively and in- 25 
dividually and we are not closing our minds to possible future 
developments. The nature of the legal processes with the many 
shades of distinction do not lend themselves to adequate clas- 
sification through automation by any present systen. 

30 

"Several opportunities to explore the development of a 
project similar to that you suggest have been proposed to us 
and we have not been deterred by cost or technical difficul- 
ties but rather that our study of electronic processing com- 
putors have shown very clearly that they would not be ade- 35 
quate nor acceptable to the legal profession. 


"Several bar association committees have made studies 
and contacted us on the subject. We have advised them that 
West Publishing Company is continuing its study of all pos- 4O 
sible methods of improving and shortening legal research but 
there would be no justification in our offering to the legal 
profession a plan of mechanical storage and automatic re- 
search of the law that employs any present electronic conm- 
putors, tape recorders, or other mechanical devices." 45 
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In his letter Mr. Marvin stated further: 


"We assure you that West Publishing Company is giving con- 
stant thought and study to the subject and in this day of sci- 
entific developments, anything may be possible for the future; 
but today there is no development that offers a practical plan 
that would have the support and acceptance of the legal pro- 
fession.” 


b. PROGRAM OF THE EDR COMMITTEE AT THE 1960 ANNUAL 
MEETING OF THE ABA IN WASHINGTON, D.C. 








Demonstration of the Use of Electronic Data Processing Sys- 
tems in Legal Research 


jw 
. 


Speaker: John F. Horty, Director, Health Law Center, Uni- 
versity of Pittsburgh, and others 


Demonstrations of the application of electronic data pro- 
cessing systems to legal research will be presented under 
the joint sponsorship of the EDR Committee and the Uni- 
versity of Pittsburgh. 


2. Demonstration of the Use of Electronic Data Processing 
Equipment in Patent Office Searching 


Speaker: Dovald D. Andrews, Director, Research and De- 
velopment, United States Patent Office, and 
others 


Demonstrations of the application of electronic data pro- 
cessing equipment to Patent Office searching will be pre- 
sented under the joint sponsorship of the Electronic Data 
Retrieval Committee and the United States Patent Office. 


Both of the foregoing demonstrations will be made periodically 
every day during the Convention, commencing Saturday, August 
27. Because of the limited space available at IBM headquarters, 
it will be necessary to obtain tickets for reservations for 
specific demonstrations at the Registration Desk at the Hotel 
Statler. Bus transportation will be available to the site of 


the demonstrations at IBM headquarters at 1220 Nineteenth Street, 


N.W., in Washington, D.C. 
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3. Meeting of the EDR Committee, Monday, Aug. 29, Congressional 
Room, Hotel Statler, 9:00 AM to 12:00 AM. 


9 :00 INTRODUCTION 


GENERAL REMARKS 


Cecil E. Burney, Chairman of the Bar 
Activities Section, Member of Texas 
Bar, Corpus Christi, Texas 


Richard F.C. Hayden, Chairman of 
El.ectronic Data Retrieval Committee, 
Member of California Bar, Los Ange- 
les, California 


9:10 "Experience with the Application of Electronic Data Pro- 
cessing Systems in General Law" 
John F. Horty, Member of Pennsylvania Bar, 
Director, Health Law Center, University 
of Pittsburgh. 


9:35 "Experience with Electronic Searching of United States 


Patents" 


Donald D. Andrews, Member of District of 
Columbia Bar, Director of Research and 
Development, United States Patent Office. 


10:00 PANEL 
DISCUSSION 


Moderator: 


Panelists: 


"Progress and Problems in the ap- 
plication of Electronic Data Pro- 
cessing Systems to Legal Research" 


Reed C. Lawlor, Member of California 
Bar, Los Angeles, California 


Vincent P. Biunno, Member of New 
Jersey Bar, Newark, New Jersey 


Vincent Fiordalisi, Professor of 
Law and Law Librarian, Rutgers 
University 


Allen Kent, Associate Director, 
Center for Documentation and Com- 
munication Research, Western Re- 
serve University 


H. Peter Iuhn, Advisor, Business 
Intelligence Systems, Advanced Sys- 
tems Development Section, Interna- 
tional Business Machines Corpora- 
tion, Poughkeepsie, New York 
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The foregoing panelists will discuss legal research methods 1 
in general, the capabilities of data processing systems for legal 
research, the history of the application of mechanized techniques 
to legal research, and the economics and advantages of mechanized 


legal research. 
Reed C. Lawlor 


Chairman of the 1960 Program 
of the EDR Committee 


¢. PRACTICING LAW INSTITUTE COURSE DEALING WITH 
INFORMATION RETRIEVAL 








The Practicing Law Institute (20 Vesey St., New York 7, N.Y. )15 
has announced a course on current problems in patent law. The 
session scheduled for 7:30 PM, Monday, July ll, at the Statler- 
Hilton Hotel in New York City, is entitled "Information Retrieval". 


It is described as follows: 
26 


"Storing and retrieving information as mechanical 
aids in searching; the patent lawyer's interest; 


practical aspects and ultimate objectives.” 
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POND pp tacts L_ 


University of Michigan 
Law Library 
Aan Arbor, Michigan 


Citations to MULL 


To keep citations brief, the following system will be used in 
making citations to MULL. Example: MULL 60M6c 


Volume number -- coincides with year of publication 
Month of publication -- M March S September 
J June D December 





Section number 
Subsection number 
Page number (if necessary) 


The first issue of MULL appeared in September 1959. 


Since one of the major purposes of MULL is to provide 
a vehicle for the exchange of information about 


1. the uses of modern logic in law and 


2. modern methods of information retrieval 
in law ? 


readers are urged to send in news about research in 
progress that will be appropriate for inclusion in 
MULL. ; 
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